Perceptual learning of degraded speech by minimizing prediction error.
نویسندگان
چکیده
Human perception is shaped by past experience on multiple timescales. Sudden and dramatic changes in perception occur when prior knowledge or expectations match stimulus content. These immediate effects contrast with the longer-term, more gradual improvements that are characteristic of perceptual learning. Despite extensive investigation of these two experience-dependent phenomena, there is considerable debate about whether they result from common or dissociable neural mechanisms. Here we test single- and dual-mechanism accounts of experience-dependent changes in perception using concurrent magnetoencephalographic and EEG recordings of neural responses evoked by degraded speech. When speech clarity was enhanced by prior knowledge obtained from matching text, we observed reduced neural activity in a peri-auditory region of the superior temporal gyrus (STG). Critically, longer-term improvements in the accuracy of speech recognition following perceptual learning resulted in reduced activity in a nearly identical STG region. Moreover, short-term neural changes caused by prior knowledge and longer-term neural changes arising from perceptual learning were correlated across subjects with the magnitude of learning-induced changes in recognition accuracy. These experience-dependent effects on neural processing could be dissociated from the neural effect of hearing physically clearer speech, which similarly enhanced perception but increased rather than decreased STG responses. Hence, the observed neural effects of prior knowledge and perceptual learning cannot be attributed to epiphenomenal changes in listening effort that accompany enhanced perception. Instead, our results support a predictive coding account of speech perception; computational simulations show how a single mechanism, minimization of prediction error, can drive immediate perceptual effects of prior knowledge and longer-term perceptual learning of degraded speech.
منابع مشابه
Effect of MMSE- STSA Algorithm in CELP and MELPSpeech Coders
The role of speech coding is to reduce the bit rate by maintaining good speech quality. In order to improve the perceptual quality of degraded speech, different speech enhancement methods can be used. So, it is worthwhile to do research in joint systems (Speech Enhancement and Low bit rate speech coders). The work reported in this paper shows the improvement in the perceptual quality of speech ...
متن کاملL2 Learners’ Lexical Inferencing: Perceptual Learning Style Preferences, Strategy Use, Density of Text, and Parts of Speech as Possible Predictors
This study was intended first to categorize the L2 learners in terms of their learning style preferences and second to investigate if their learning preferences are related to lexical inferencing. Moreover, strategies used for lexical inferencing and text related issues of text density and parts of speech were studied to determine their moderating effects and the best predictors of lexical infe...
متن کاملPerceptual learning of dysarthric speech: a review of experimental studies.
PURPOSE This review article provides a theoretical overview of the characteristics of perceptual learning, reviews perceptual learning studies that pertain to dysarthric populations, and identifies directions for future research that consider the application of perceptual learning to the management of dysarthria. METHOD A critical review of the literature was conducted that summarized and syn...
متن کاملTHE ROLE OF FACIAL GESTURAL INFORMATION IN SUPPORTING PERCEPTUAL LEARNING OF DEGRADED SPEECH By
....................................................................................... .... ii ACKNOWLEDGEMENTS ........................................................................... iv LIST OF FIGURES .................................................................................... viii CHAPTER 1. GENERAL INTRODUCTION ....................................................... . 1 Speech ...
متن کاملThe “kiel Corpus of Read Speech” as a Resource for Prosody Prediction in Speech Synthesis
The naturalness of synthetic speech depends strongly on the prediction of appropriate prosody. For the present study the original annotation of the German speech database “Kiel Corpus of Read Speech” was extended automatically with syntactic features, word frequency, and syllable boundaries. Several classification and regression trees for predicting symbolic prosody features, postlexical phonol...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Proceedings of the National Academy of Sciences of the United States of America
دوره 113 12 شماره
صفحات -
تاریخ انتشار 2016